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METASTABILITY FOR THE CONTACT PROCESS ON THE 
PREFERENTIAL ATTACHMENT GRAPH 

VAN HAO CAN 


Abstract. We consider the contact process on the preferential attachment graph. The 
work of Berger, Borgs, Chayes and Saberi |BBCS1| confirmed physicists predictions that 
the contact process starting from a typical vertex becomes endemic for an arbitrarily 
small infection rate A with positive probability. More precisely, they showed that with 
probability it survives for a time exponential in the largest degree. Here we obtain 

sharp bounds for the density of infected sites at a time close to exponential in the number 
of vertices (up to some logarithmic factor). 

1. Introduction 

The paper aims at proving a metastability result for the contact process on the prefer¬ 
ential attachment random graph, improving |BBCSl] ’s result in two aspects: obtaining a 
better bound on the extinction time, and estimating more accurately the density of the 
infected sites. 

The contact process is one of the most studied interacting particle systems, see in 
particular Liggett’s book jQ, and is also often interpreted as a model to describing how a 
virus spreads in a network. Mathematically, it can be defined as follows: given a locally 
finite graph G = (V, E) and A > 0, the contact process on G with infection rate A is 
a Markov process on {0,1}'^. Vertices of V (also called sites) are regarded as 

individuals which are either infected (state 1) or healthy (state 0). By considering as a 
subset of V via = {n : ^t(n) = 1}, the transition rates are given by 

^ ^t \ {n} for n G at rate 1, and 

6 ^ 6 U {n} for V ^ at rate A deg^^(n), 

where deg^^ (n) denotes the number of edges between v and other infected sites (Note that 
if G is a simple graph, i.e. contains no multiple edges, then deg^j(n) is just the number of 
infected neighbors of v at time t). Given that A G V, we denote by (^/^)t>o the contact 
process with initial configuration A. If A = {n} we simply write {^t)- 

Originally the contact process was studied on integer lattices or homogeneous trees. 
More recently, probabilists started investigating this process on some families of random 
graphs like the Galton-Watson trees, conhguration models, random regular graphs, and 
preferential attachment graphs, see for instance [El IGDl IGSl [Dl IMVYl IMVl IBBGSlj . 

The preferential attachment graph (a dehnition will be given later) is well-known as a 
pattern of scale-free or social networks. Indeed, it not only explains the power-law degree 
sequence of a host in real world networks, but also reflects a wisdom that the rich get 
richer - the newbies are more likely to get acquainted with more famous people rather 
than a relatively unknown person. Therefore there has been great interest in this random 
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graph as well as the processes occuring on it, including the contact process. In |BBCS1] . 
by introducing a new representation of the graph, the authors proved a remarkable result 
which validated physicists predictions that the phase transition of the contact process 
occurs at A = 0. More precisely, they showed that there are positive constants 6, c and 
C, such that for all A > 0 

(1) (Cxp(eA2v^) 7^ 

where is the contact process starting from a uniformly chosen vertex. 

Recently, in |BBCS2) they used their new representation to show that the preferential 
attachment graph converges weakly to a limit, called the Polya-point graph. 

In this paper we will use this convergence as well as the new representation to improve 
o. Here is our main result. 

Theorem 1.1. Let {Gn) be the sequential model of the preferential attachment graph with 
parameters m > 2 and a G [0,1). Consider the contact process {^t) with infection rate 
A > 0 starting from full occupancy on Gn- Then there exist positive constants c and G, 
such that for A small enough, 

(2) P„ ^cA^+^llogAp^ < ^ < CA^+^llogAp^^ = l-o(l). 

where and (tn) is any sequence satisfying tn ^ oo and tn <Tn = exp ^ (log^p/V’ ) • 

By a well-known property of the contact process called self-duality (see [Q, Section I.l) 
for any f > 0 we have 

(3) Elite” = 

V&Vn 

Therefore the survival probability as in ([1]) is just the expected value of the density of 
infected sites as in Theorem ll.il so that our result is a stronger form of the one in |BBCS1] . 
Additionally we get a more precise estimate of the density and we allow [tn) to be larger. 
Let us comment on its proof now. 

First, to obtain the time Tn we will use the maintenance mechanism as in cm instead 
of the one in |BBCS1] . In the latter the authors used that in the preferential attachment 
graph the maximal degree is of order y^, plus the well-known fact that for any vertex 
V, the process survives a time exponential in the degree of v, once it is infected, yielding 
(fT|). In the former, on the other hand, when considering the contact process on the 
conhguration model, Chatterjee and Durrett employed many vertices with total degree of 
order n^~^, for any e > 0, and derived a much better bound on the extinction time. Here, 
our strategy is to hnd vertices with degree larger than Gd{Gn), where G = G{X) > 0 is a 
constant and d[Gn) is the diameter of Gn, which is of order logn. Thanks to Proposition 1 
in cm. we can deduce that the virus propagates along these vertices for a time exponential 
in their total degree. Moreover, the degree distribution of the graph, denoted by p, has 
a power-law with exponent n = 2 + l/fj- Thus the number of these vertices is of order 
n(log?7,)^“^ and their total degree is of order ?7,(logn)“^/'*^, which explains the bound on 
tn in Theorem ll.il 

It is worth noting that for any graph with order n edges, like Gn, the extinction time 
of the contact process is w.h.p. smaller than exp(C'n), for some G > 0. Hence our bound 
on tn is nearly optimal. 
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Secondly, to gain the precise estimate on the density, we use ideas from [Pi IBBCSll ICDl 
IMVYj : if the virus starting at a typical vertex wants to survive a long time, it has to infect 
a big vertex of degree significantly larger than A“^. Then the virus is likely to survive 
in the neighborhood of this vertex for a time which is long enough to infect another big 
vertex, and so on. We can see that the time required for a virus to spread from one big 
vertex to another is at least (corresponding to the case when the distance between 

them is constant). Besides, it was shown that if deg(n) > K/\^, then the survival time 
of the contact process on the star graph formed by v and it neighbors is about exp(ciP). 
Hence the degree of big vertices should be larger than CA'^I log A|. Then we consider A, 
the set of vertices which have a big neighbor. The probability for a vertex in A to infect 
its big neighbor is of order A. Moreover, we will show in Section 4 that any big vertex 
has a positive probability to make the virus survive up to time T„. This means that the 
probability for the dual process starting from any vertex in A to be active at time is 
of order A. Furthermore, we will see that these events are asymptotically independent. 
Therefore the density of vertices from where the dual process survives up to T„ is about A 
times the density of A. This is of order A x p([A ^1 log A|, co))A ^1 log A| X I log A| v- 
yielding the desired lower bound. 

We notice that the density of big vertices is of order p([A“^| log A|, cxd)), which is not 
optimal. Hence we need to consider also their neighbors. In fact this idea of using A was 
first introduced in laa for the configuration model (CM). 

For the upper bound, we look at the local structure of Gn- In particular, we will show 
that its weak limit, the Polya-point graph, is locally dominated by Galton-Watson trees. 
Then the results in |MVYj will be applied to deduce our desired bound. 

It is interesting to note also that if we consider the contact process on the CM with the 
same power-law degree distribution, the density is of order | log A|~^ (see |MA/Yl 
Theorem 1.1]), which is slightly smaller than the one in ([2]). This difference is due to the 
fact that the distance between big vertices in the CM is about | logA|, instead of constant 
here. 

Finally, the above strategy works properly when studying the contact process on the 
Polya-point graph. In fact, proving the following result helped us pave the way to potential 
solutions for Theorem o 

Proposition 1.2. Let (^°) be the contact process on the Polya-point graph with infection 
rate A > 0 starting from the root o. There exist positive constants c and C, such that for 
A small enough, 

cA^+^/’^l \ogX\-^/^ < P(^° ^ 0 Vf) < C'A^+^/’^l \ogX\-^/^. 

Theorem 11.11 (resp. Proposition II.2p implies that for all A > 0, the contact process 
becomes endemic (resp. survives forever) with positive probability. We say that the 
critical values of the contact process on the preferential attachment graph and its weak 
limit are all zero. This is a new example of a more general phenomena that there is 
a relationship between the phase transition for the contact process on a sequence of 
finite graphs and the one on its weak local limit in the sense of Benjamini-Schramm’s 
convergence. Here are some known results on this topic: the contact process on the 
integer lattice and on finite boxes |1, n]'^ exhibit a phase transition at the same critical 
value Ac = Xc{d), see [Q Part I] for all d > 1; the phase transition of the process on the 
random regular graph of degree d and its limit, the homogeneous tree T^, occurs at the 















4 


VAN HAO CAN 


same constant Ai(Td), see |MV) : the phase transition of the contact process on (the 
c?-homogeneous tree of height t) and its limit, the canopy tree CT^, happens at A 2 (Trf), 
see |CMMVl IMV] : the critical value of the contact process on the conhguration model 
with heavy tail degree distributions or on its limit, the Galton-Watson tree, is zero, see 

[Pi ICDl IMVYl IMMVYl IC^ . 

Now the paper is organized as follows. In the next section, based on |BBCS2| . we give 
the dehnition of the sequential model of the preferential attachment graph as well as its 
weak local limit, the Polya-point graph. We also prove preliminary results on the graph 
structure and hx some notation. In Section 3, we prove Proposition [T^ Finally, the main 
theorem is proved in Section 4. 


2. Preliminaries 

2.1. Construction of the random graph and notation. Let us give a dehnition 
following |BBCS2] of the sequential model of the preferential attachment graph with 
parameters m > 2 and a G [0,1). We construct a sequence of graphs (G„) with vertex 
set Vn = {ui,..., Vn} as follows: 

First Gi contains one vertex vi and no edge, and G 2 contains 2 vertices ^ 1,^2 and m 
edges connecting them. Given G„_i, we dehne the following way. Add the vertex 
Vn to the graph, and draw edges between and m vertices tCnp,..., Wn,m (possibly with 
repetitions) from Gn-i as follows: with probability an\ the vertex Wn,i is chosen uniformly 
at random from Vn-i- Otherwise, Wn,i = Vk with probability 

degi(li(nfc) 

^n-l 

where 

(i) f ^ 

= I ^_ 2m(n-l) _ 

L 2m{n—2)+2ma-\-{l—a){i—l) 

degj(li(nfc) = deg„_i(nfc) + #{1 < j < i - 1 : Wn,j = Vk}, 
is the degree of Vk before choosing Wn^i, and 

n—1 n—1 

Z^n-i = degi*li(ufc) = degn-i(^^fc) +i-I = 2m(m - 2) + * - 1, 

k=l k=l 

with deg„_]^(nfc) the degree of Vk in Gn-i- 

This construction might seem less natural than in the independent model where with 
probability a we choose Wn,i uniformly from W-i and with probability 1 — a it is chosen 
according to a simpler rule: Wn,i = Vk with probability deg„_^(nfc)/2m(m — 2). However 
the sequential model is easier to analyze because it is exchangeable, and as a consequence 
it admits an alternative representation which contains more independence. In |BBGS2] . 
the authors called it the Polya urn representation which we now recall in the following 
theorem. To this end, we denote by /3(a, b) the Beta distribution, whose density is propor¬ 
tional to x““^(l — x)^~^ on [0,1], and by F(a, 6) the Gamma distribution, whose density 
is proportional to on [0,oo). For any a < b, U{[a,b]) stands for the uniform 

distribution on [a,b]. 


if i = 1, 
0{n~^) if i > 2, 
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Theorem 2.1. |RR(::S2[ Theorem 2.1.] Fix m > 2, a G [0,1) and n > 1. Set r = 
a /(1 — a), "01 = 1 , and let 'ijj 2 ,..., be independent random variables with law 


ijjj ~ /3(m + 2mr, (2j — 3)m + 2mr{j — 1)). 

Define 

n k 

Fj = i^j n(l-V’t), Sk = '^^Pj, and h = [Sk-i, Sk). 

t=j+i j=i 

Conditionally on fii,... ^ipn, IC {Uk,i}k=i,...,n,i=i,...,m be a sequence of independent random 
variables, with Uk,i ~ W([0, S'fc-i]). Start with the vertex set Vn = {Ti,...,Tn}- For 
j < k, join Vj and Vk by as many edges as the number of indices i G {1,..., m}, such that 
Uk,i G Ij. Denote the resulting random graph by Gn- 

Then Gn has the same distribution as the sequential model of the preferential attachment 
graph. 


From now on, we always consider the random mnlti-graph Gn constructed as in this 
theorem. 


We now look at the local structure of Gn- It was shown in |RRCS2] that Gn is locally 
tree-like, with some subtle degree distribution that we now recall. First we fix some 
constants: 


1 -I- 2r 

2 + 2r 


and 


= 


1 -X 
X 


1 

1 -|- 2r 


Note that 1/2 < y < 1 and 0 < < 1. Let F ~ F(m-|-2mr, 1) and F' ~ F(m-|-2mr-|-l, 1). 

We will construct inductively a random rooted tree (T, o) with vertices identified with 
elements of (where vertices at generation i are elements of and a map which 

associates to each vertex v a position x^ in [ 0 , 1 ]. Additionally each vertex (except the 
root) will be assigned a type, either R or L. 


• The root o = (0) has position Xq = Uq, where Uq i/(| 0 ,l|). 

• Given v E T and its position x^, define 


j m if n is the root or of type L, 

^ m — 1 if n is of type R. 


and 


( F if n is the root or of type R, 
\ F' if n is of type L. 


The children of v are (n, 1),... ,(n, my), (n, m^-|-l),... ,(n, my+qy), the first m^’s are 
of type L and the remaining ones are of type R. Conditionally on Xy, x^y^i), 
are i.i.d. uniform random variable in [ 0 ,a:^], and ■ ■ ■, are the 

points of the Poisson point process on [xy, 1] with intensity 


py{x)dx 


Tpx'^ ^ , 
'Jv — - — dx. 


x 




This procedure defines inductively an infinite rooted tree (T, o), which is called 
the Polya-point graph and {xy)y^T is called the Polya-point process. 

For any vertex n in a graph G and any integer R, we call Bg{v, R) the ball of radius R 
around v in G, which contains all vertices at distance smaller than or equal to R from v 
and all edges connecting them. 
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Theorem 2.2. |BBCS2[ Theorem 2 . 2 .] Assume that the random graph Gn is construeted 
as in Theorem \2.1[ Letu he a vertex chosen uniformly at random in Gn and let R be some 
fixed constant. Then Bg^{u,R) converges weakly to the ball Bt{o,R) in the Polya-point 
graph. 


Now we introduce some notation. We call P„ a probability measure on a space in which 
the random graph Gn is defined together with the contact process. Since we will fix A, 
we omit it in the notation. We also call P a probability measure on a space in which the 
Polya-point graph as well as the contact process are defined. 

We denote the indicator function of a set A by 1(A). For any vertices v and w we write 
n ~ tc if there is an edge between them (in which case we say that they are neighbors or 
connected), and v w otherwise. We call size of G the cardinality of its set of vertices, 
and we denote it by |G|. 

A graph in which all vertices have degree one, except one which is connected to all the 
others is called a star graph. The only vertex with degree larger than one is called the 
center of the star graph, or central vertex. 

If / and g are two real functions, we write / = 0{g) if there exists a constant G > 0, 
such that f{x) < Gg{x) for all x; f > g (or equivalently g < f) A g = G(/); / x 51 if 
/ = 0{g) and g = G(/); / = o{g) if g{x)/f{x) —)■ 0 as x —)■ 00 . Finally for a sequence of 
r.v.s {Xn) and a function / : N —)■ ( 0 , 00 ), we say that Xn x f[n) holds w.h.p. if there 
exist positive constants c and G, such that Pn(c/(n) < Xn < Gf{n)) 1. 


2.2. Preliminary results on the random graph. We first recall a version of the 
Azuma-Hoeffding inequality for martingales which we will use throughout this paper (see 
for instance CQ). 


Lemma 2.3. Let (Xj)j>o be a martingale satisfying \Xi — Xj_i| < 1 for all i > 1. Then 
for any n and t > 0, we have 

P {\Xn — Wol >t)< 2exp(—f^/2n). 


From this inequality we can deduce a large deviations result. Let (W)i>i be a sequence 
of independent Bernoulli random variables. Assume that 0 < 2p < E„(Xj) < Mp for all 
i. Then there exists c = c(M) > 0, such that for all n 


(4) 


P 


np < Xi < 2Mnp 


i=l 


> 1 — exp(—cnp). 


Now we present some estimates on the sequences {(fi), (fij) and (Sk). 


Lemma 2.4. Let {4>j)j and {Sk)k be sequences of random variables as in Theorem 

o Then there exist positive constants p and 6 q, such that for all 9 < 9 q, the following 
assertions hold. 


(i) E„(^,-) = f+ ^(^), 


(ii) For any e > 0, there exists K = K{e) < 00 , such that 

> 1 - 

where 

£-e = {l-Ffc — {k/n)^\ < e{k/n)^ ^K{e) < k < n}. 
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(iii) For any i < j, define = n (1 — 'ft)- Then 

t=i+l 


En(sF’) < (iU f ■ 


(iv) (/i/j > -fj > 9/j) > 29. 


(v) > 9/j)) > 9En{ipj). 

Proof. Let us start with Part (i). Observe that if ~ I3{a,b), then 


E{fj) = 


a + b 


and E{'f'^) = 


a(^a 1 ) 


(fl + 6 ) (fl + 6 + 1 ) 

Hence the result follows from the fact that 'ipj ~ /3(m + 2mr, (2j — 3)m + 2mr{j — 1)). 

Part (ii) is exactly Lemma 3.1 in |BBCS2| . We now prove (iii). From the first observa¬ 
tion in Part (i), we have Eni'ft) ^ x/t t > 1. Moreover, the {'ft) are independent, 

hence we get that 


E„(s®)= n (i-E„(<f,))< n (i 


t=i+l 

^ n (1 

t=i+l 


]^\ X 

t 


t=i+l 

3 


Here we have used that if 0 < y, a; < 1, then (1 — yx) < (1 — x)^. 
For Part (iv), Markov’s inequality gives that for any 6 G (0,1) 

Yaxni'fj] 


(Iflj - En{f>j)\ > (1 - 6)En{f>j)) < 


(1-<5)2E„(V^,)2- 


Moreover, if -0 ~ (3 {a, b), then 
Var(V>) E{ip‘^) 


E(V>)2 E(V^)2 

Therefore for any j 


1 = 


(a -|- 1) (a -|- b) 


a{a -|- 6 -|- 1 ) 
Eni'fj e {6En{f>j), (2 - 6)En{f>j))) > 1 - 


1 = 


< 


a{a -|- 6 -|- 1 ) a 
1 


(1 — S)^(m -|- 2mr) 

Hence thanks to (i) we can choose positive constants jj, and 6*, such that for all j 


(5) 


P.. I e I jT I I > 29, 


For (v), we notice that 

En{y:>jl{'fj > 9/j)) = En{'fjl{'fj > 6'/j))E„ ( JJ 


\t=j+i 


> cEn{'fj)En ( JJ (1 - i^t) 

Kt=j + l 


cEji {^j) , 

for some c > 0, independent of j. Thus the result follows by taking 9 small enough. □ 
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The preferential attachment graph is known as a prototype of small world networks 
whose diameter and typical distance (the distance between two randomly chosen vertices) 
are of logarithmic order. In fact, these quantities in the independent model were well- 
studied, see for instance [DVHl or [^. In the following two lemmas, we prove similar 
estimates for the sequential model. These estimates are in fact weaker but sufficient for 
our purpose. 


Lemma 2.5. Let d{Gn) be the diameter of the random graph G„, i.e. the maximal distance 
between pair of vertices in Gn- Then there exists a positive constant bi, such that 

Pn(d(G„) < feilogn) = 1 - o(l). 


Proof. Let £ G (0,1/2) be given, and recall the definitions of K{e) and given in Lemma 
a (ii). We first bound d{vi,Vn)- Define a decreasing sequence (rii)j>o by no = n, and 
the condition that and are neighbors (note that the choice of (n*) is not unique 
in general). Define 


Xi = l{{ni < ?7,j_i/2}) and = aiprij : j < i) V 


Denote by 


an = inf{i ; rij+i < logn}. 


If i < an-i then Ui > logn > 2K{e). Therefore due to the construction of the graph, for 
such i we have on £i. 


< ni /2 I Xi) 


^ I- e ^ 1 

- (1 + £)2X - 2X+2 


p > 0. 


In other words, on 8^ we have 


( 6 ) 


E„(Xi+i I Ti)Hi < ff„) > pl(i < ff„). 


Let 

k 

W = 5^(W-E,,(W I J-,_i)). 

Then (T/) is a martingale with respect to the filtration (Xk) and \Yk — Yk-i\ < 1. By 
using Lemma 12.31 we get that 

Pn (Dfc < -kp/2 I {(ft)) < 2exp{-kp‘^/8), 


which implies by using ([6]) that on 


(7) 




< kp/2, an > k I {ipt) I < 2exp(-fcp^/8). 


. i=l 


Moreover, if ~ log 2 log^)) then < logn (or equivalently an < k). 

Hence it follows from ([7]) that on 8^ 


Pn(0-n > Glogn I (pf)) < 


^ C log n 

^ w < log 2 n - log 2 logn 


{Pt) = 0{n-^), 


2=1 


for some G = G(p) > 0. 

On the other hand, if cr„ < Glogn, then d{vi,Vn) < (G -|- 1) logn. We thus obtain that 
on 8e 


(8) 


Fn{d{vi,Vn) > (G -h 1) logn I (ift)) = 0{n ^). 
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Let dGi,{vi,Vj) be the distance between Vi and Vj in Gk for iG ^ k < n. Note that 

dG^{vi,Vj) > d{vi,Vj) = dG^{vi,Vj). 

Similarly to ([S]), we deduce that on for all i > C\ogn, 

¥n{d{vi,Vi) > (C + 1) logi I (v9t)) < ¥i{dGi{vi,Vi) > (C + 1) logi | {^pt)) = 

Hence on 

¥n{d{vi,Vi) < (C + 1) logn Vi > Clogn \ {ipt}) = 1 - o(l). 

Therefore by taking expectation with respect to {(pt) and using Lemma [2.41 (ii), we get 

Pn(d(Gn) < 2{C + 1) logn) > 1 - 2e, 

which proves the result by letting e tend to 0. □ 


Before proving the lower bound on the typical distance, we make a remark which will 
be used frequently in this paper. It follows from the definition of Gn that for all i < j, 


Pn(ni 7^ Vj I (cpt)) = 1 - 






Hence 


^ ~ V, I (cpt)) < 

*-> 7-1 ^i-1 


Then by using the following identities 


i-i 


Sj-i = '^P’t = JJ(1 - V't) and <Pi = 'ilJi JJ (1 - A), 

t=l t=j 


we obtain that 

(9) '07*5'?"^^ < Pn(ni ~ Vj I ( 9 ?t)) < miJiS, 

where we recall that 

kt) 


L-i) 

i ’ 




t=i+l 


Lemma 2.6. Let wi and W 2 be two uniformly chosen vertices from foi- Then there is a 
positive constant fo, such that w.h.p. 

fologn 

^^ 2 ) > -j—j-• 

log log n 



Proof. We will use an argument from [Vl Lemma 7.16]. We call a sequence of distinct 
vertices vr = (tti, ..., tta,) a self-avoiding path. We write vr C Gn if and Tj+i are 
neighbors for all 1 < i < /c — 1. Let n(q j, k) be the set of all self-avoiding paths of length 
k starting from Vi and finishing at Vj. We then claim that for all i,j, k > 1, 

(i) Fn{d{vi,Vj) = k) < gkiij) := E C Gn), 

n£Tl{iJ,k) 

(ii) gk+i{i,j) < E 9i{hs)gk{s,j). 
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The first claim is clear, because if d{vi,Vj) = k then there exists a self-avoiding path in 
which is in Gn- For the second one, we note that for any self-avoiding path 

71 , . . . , TT/j), 


P„(7r C Gn) = IPn(7ri ~ 7r2)Pn(7r C Gn), 

where vf = ( 712 ,..., 77k)- Indeed, if j < k, then the event that Vj ~ Vk depends only on the 
{Uk,i)i<m- Hence this result follows from the facts that the vertices in vr are distinct and 
that the {{Uk,i)i<m}k are independent. We are now in position to prove (ii): 

9k+i{i,j) = ~ C Gn) 

Vi^TT 

< IPn(r’i ~ r’s)IPn(7r C Gn) 

s^i,j 7T£ll{sJ,k) 

= 9i{hs)gk{s,j). 


We prove by induction on k that there is a positive constant G, such that 


( 10 ) 


9k{i,j) < 


{Glogn)^ 


For k = 1, thanks to ([Oj) and Lemma [2.41 (hi), for all i < j we have 


(11) 


= Pn(Ki ~ Kj) 

< mEn{i7i)En{St^^) 


< mEniTpi) 


G 



X 


for some C > 0. The existence of G follows from the facts that E„('0j) x 1/i and y > 1/2. 

Assume now that the result is true for some fc, and let us prove it for k + 1. By using 
the induction hypothesis, (ii) and (ITT]) we get that 


9k+i{i,3) < ^9i{hs)gk{s,j) 
s¥=i,j 




G {Glogn)^ 
Vis y/V) 


(Clogn)^+^ 

VU ’ 
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which proves the induction step. Now it follows from (i) and dTOj) that 

1 ^ 

Fn{d{wi,W2) <K)< — E E 9k{i,j) 

k=l 



E E 

k=l 


{C\ognY 


^ (Clog 77 ,)^+^ 

“ n 

Therefore if = logn/( 2 Cloglog? 7 ,), then Fn{d{wi,W 2 ) < K) = o(l). 
desired lower bound with 62 = 1/(2C). 


This gives the 
□ 


Remark 2.7. The bound in the above lemma is probably not sharp. Indeed for the 
independent model, it is proved in |DVH] or [V] that if a > 0 (or equivalent y > 1/2), 
then w.h.p. d{wi,W 2 ) > clogn; and otherwise, w.h.p. d{wi,W 2 ) > logn/(C + log log n), 
for some positive constants c and C. 


2.3. Contact process on star graphs. We will see that star graphs play a crucial 
role in the conservation of the virus on the preferential attachment graph. Hence, it is 
important to understand the behavior of the contact process on a single star graph as 
well as the transmission between them. These have been studied for a long time by many 
authors, for instance in [Pl IBBCSll ICDl IMVY| . The results we need will be summarized 
in Lemma and below, but hrst a dehnition: we say that a vertex v is lit (the term 
is taken from EH) at some time t if the proportion of its infected neighbors at time t is 
larger than A/(16e). 

Lemma 2.8. Let {^t) be the contact process on a star graph S with center v. There exists 
a positive constant c*, such that the following assertions hold. 

(i) P {v is lit at time 1 | ^o('i^) = 1 ) > c*(l — exp(—CAIS'D). 


(ii) F {3t > 0 : V is lit at time t \ ^o{.v) = 1) ^ 1 as [S’! —)■ cxo. 

(hi) If X‘^\S\ > 64e^, and v is lit at time 0, then v is lit during the time interval 
[exp(c*W|S'|), 2 exp(c*A^|S'|)] with probability larger than 1 — 2 exp(—CADIS'!). 

Proof. Parts (i) and (ii) are exactly Lemma 3.1 (i), (hi) in |MVY] . For (iii) we need an 
additional dehnition: a vertex v is said to be hot at some time t if the proportion of 
its infected neighbors at time t is larger than A/( 8 e). Then in |CD| the authors proved 
(with different constants in the dehntion of lit and hot vertices, but this does not effect 
the proof) that 

• if n is lit at some time t, then it becomes hot before t + exp(c*A^ deg(n)) with 
probability larger than 1 — exp(—c*A^ deg(n)), 

• if n is hot at some time t, then it remains lit until t + 2 exp(c*A^ deg(n)) with 
probability larger than 1 — exp(—c*A^ deg(n)). 

Now (iii) follows from these results. □ 

The following result is Lemma 3.2 in |MVY) . 

Lemma 2.9. Let us consider the contact process on a graph G = {V,E). There exist 
positive constants c* and Aq, such that ifO<X< Aq, the following holds. Let v and w be 
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two vertices satisfying deg(u) > log (^) d{v, w). Assume that v is lit at time 0. Then 
w is lit before exp(c*A^ deg(w)) with probability larger than 1 — 2exp(—c*A^ deg(u)). 


3. Proof of Proposition 11.21 


In this section we stndy the contact process on the Polya-point graph {T,o). In fact 
like with other examples mentioned in the introdnction, we will see in the next section 
that the results and proofs on this graph will give us some insight in dealing with the 
original finite graph’s problem. 

We first make some observation on the degrees of the neighbors of the root (0). We 
denote by wq = (0), and xq = Xwq- For any i > 1, let 

Wi = ( 0 , 1 ,..., 1 ) and xt = Xw^. 


Then degree conditioned on Xi is m+1 plus a Poisson random variable with parameter 


T, 


1_ 


'ifx'^ ^dx = 7 


1-xl 


xj 




where 7 is a Gamma random variable with parameters a = m + 2mr + 1 and 1. Therefore 
letting K = (1 — xf)/xf, we have 


q{k I Xi) := P(deg(tai) = m + l + k\xi) = E 

( 12 ) 

where T{b) = x^~^e~^dx. 


3 - 7 K 


k\ 

r(fc + a) 
r(a)A:! 


(7«) 


Xi = 


{l-xt)^xf, 


K^r(A: + a) 

[K + l)^+^T{a)k\ 


3.1. Proof of the upper bound. The idea of this part is to show that locally the Polya- 
point graph can be viewed as a subgraph of a certain Galton-Watson tree and then apply 
some proofs from |MVY] . For all i and k > 0, we denote by 

qfk) = P(deg(u;i) = m + 1 + k). 

The following lemma gives estimates on the tail distributions of these laws. 


Lemma 3.1. There exist positive constants c and C, such that for all k > 0 

(13) ck~‘^~^^^ < qo{k) < Ck~‘^~^^'^ and 

(14) ck~^~^^'^ < qi{k) < C{\ogkf~^k~^~^^'^ for all i>l. 

Moreover, the sequence {qf) is stochastically increasing i.e. qi Y for all i > 0. 


Proof. Assume that i > 1. Due to the construction of the Polya-point process, Xi is 
distributed as Ui.. where {Uj)j>o is a sequence of i.i.d. uniform random 

variables in [0,1]. Hence by changing the order of integration, 


qfk) = E(P(deg(M;j) = m + 1 + k \ Xi)) 


= I + l)xtdxo I —...I 
Jo Jo Xo Jo 

= (fj + l) [ q{k I Xi)dxi [ 

Jo J Xi Xi-i 


I'XQ 


dxi 


CXi-2 


dxi-i 

Xi-2 Jo 

dxi 

Jx2 . 


'■Xi-l 


q{k I Xi) 


dxi 

Xi-l 


Xq ^dxo- 


(15) 
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By induction we can show that 


g{xi) := 


dxi-i 

34 ^i—1 


r dxi 

1x2 


x^ ^dxo = (- 1 ) 


i-l 


l-a;f (logxiy 


" 0 * 


(16) 


(- 1 ) 


2—1 




1-^f + E 


{log xfy 


3 
i—1 


i=i 


f- 


By combining ffT^ . (fT5|) . (fl^ and using the change variables y = xf, we obtain 


(17) 

where 


Qi{k) = 


{y + l)T{k + a) r 
'ijjT{a)k\ Jq 

(-l)-i 


{l-y)V^^-^h{y)dy, 


i—1 


h{y) = 




1-9 + E 


(log yy 


1=1 


Since g{-) is decreasing, so is the function h{-). Therefore 


(18) 


(1 - y)V*^-'h(y)dy > h(l/2) I (1 - y)‘‘y‘**-‘dy 


.1/2 




On the other hand for k > a — 1 + l/y, we have 
"1 , ri/s 


' 1/2 


(1 - y)V^^~"dy = 


yyi-yy+h^dy< I {1 - y)Y^^~'dy. 


»l/2 




'0 


Hence, 


.1/2 


{l-y)V^^~"dy> f {l-y)Y^^~"dy. 


Combining this with flTSl) yields that 


(19) 


/ (1 - yfv'‘*r%(y)dy > h{ll2)B(k + 1,0 + 1/V-), 
Jo 

where B{a, b) = (1 — yY~^y^~^dy = Note that 

r(fc + a) 


( 20 ) 


r(A)) 


k° 


By using this in ffT7|) and flT^ we deduce that 

<li(k) > I-*-'/*, 

which implies the lower bound in (ITTj) . For the upper bound, observe that 


log2/K(l-2/)V^’^ ^dy< 


"i/fc 


logyPi/“+i ^dy+ / |log|/P(l-|/)V^^ ^dy 
Jllk 


< c{\ogkyk-^-^/^, 
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for some C = C{j) > 0, where we have used that the function | is increasing 

in a neighborhood of the origin to bound the first term and the approximation fl 2 U]) for 
the second term. On the other hand by using fl2U]) again, we get 

Now the result follows from these estimates and by using fl2U]) again for the term in front 
of the integral in f[T7|) . 

When i = 0, the proof is simpler, as the distribution of xq is just Uq. We safely leave 
it to the reader. 

For the last assertion, we note that deg(tai) is m + 1 plus a Poisson random variable with 
parameter 7(1 — xf)/xf, with 7 ~ r(m + 2mr + 1,1). On the other hand, Xj+i < Xi for 
all z > 0 (as Xj+i ~ Z//([0,Xj])). The result now follows from the well-known monotonicity 
property of Poisson random variables. □ 

Remark 3.2. The logarithmic correction in (fTT|) can not be droped, at least not com¬ 
pletely. For instance, it follows from f[T7|) and fl20]) that 

l-l/k ^ 

Yk) > / (I logi/l + 1 / - 1)(1 - yfy''^^~^dy 

Jl/2k 

> logk. 

We are now ready to show the relation between the Polya-point graph and Galton- 
Watson trees. To do that we introduce some new notation and dehnition. Given distri¬ 
butions p, p', p", we denote by GW(p, p', p") the Galton-Watson tree dehned as follows: 
the root o has degree distriution p, all children of o have degree distribution p' and all 
other vertices have degree distribution p". Then for i > 2, we let 

(Ti,o) = GkF(qo, qi,qi-i), 

where we recall that q, is the degree distribution of Wi. 

Let (Ti,oi) and (T 2 , 02 ) be two random rooted trees. We say that (Ti,oi) is stochasti¬ 
cally dominated by (T 2 , 02 ), and write it (Ti,oi) ^ (T 2 , 02 ), if there exists a coupling of 
the two trees such that a.s. Ti is a subgraph of T 2 and oi = 02 - 

Lemma 3.3. For any integer i > 2, the ball in the Polya-point graph of radius i around 
the root is stochastically dominated by the corresponding ball in the Galton-Watson tree 
GlF(qo,qi,qi-i), he. Bt{woG) :<BYoG)- 

Proof. We hrst observe that 

(i) By construction of the Polya-point graph, for any n > 0, conditionally on the 
positions of the vertices at generation n (i.e. at distance n from wq), their degree 
distributions are independent. 

(ii) If V and w have the same type and Xy ^ Xy,, then deg(t(;) ^ deg(n). Indeed, 
this follows from the construction of the graph and the monotonicity property of 
Poisson random variables. 

(hi) If ta = (v,£) with £ > my, then deg(tc) ^ deg(n). Indeed, since w is of type R, 
we have Xy, > Xy and 7 ^ ^ 7 ^ (recall that F(a, 1) is stochastically increasing in a, 
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and thus F ■< F'). Hence the claim follows as well from the monotonicity property 
of Poisson random variables. 

We now prove the lemma by induction on i. Let us start with i = 2. The claim follows 
from the following facts: 

• The degree distribution of the root is Qq. 

• If n = (0,£) is some child of the root of type L (i.e. with i < m), then v has the 
same degree distribution as (0,1), which is qi. Otherwise by using (iii) above, we 
get that deg(n) ^ deg((0)) ^ qi (recall that qo ^ qi). 

• Conditionally on (x^), the random variables {deg(n) : d{wo,v) = 1} are indepen¬ 
dent. 

Suppose now that the result holds for some i, and let us prove it for i +1. Let S{i) = {n G 
BT{wQ,i) : d{wQ,v) = i}. Then as above the induction step follows from the following 
three facts: 

• BT{wo,i) :< B^i^Oyi) ^ i?Ti+i(oO) by using the induction hypothesis and that 
<ii ^ qi+i- 

• Conditionally on {x^), the random variables {deg(n) : n G S'(f)} are independent. 

• If n G S'(f), then deg(n) ^ q*. Indeed, 

— if n is of type R, then v = {w,£), with i > rriw By using (iii) above and 
the induction hypothesis, we obtain that deg(n) ^ deg(tc) ^ q*-!. 

— if n is of type L, there are two possibilities. First, ii v = (0,.^i,... ,£i), 

with £j < m((0, £i,..., for all 1 < J < i, then x^ has the same distri¬ 

bution as Xi (note that for simplicity, here we use the notation m{v) for m^,). 
Therefore, deg(n) ~ q^. Otherwise, there are indices 1 < k < i — 1 such that 
Ik > £i,..., £k-i))- Let j be the largest such index, and let tc = (0,..., £j-i) 

and w' = {w ,!,...,!) (with [i — j)’s 1). We now see that x^ has the same law as 
X(w,i )Ui... Ui_j and has the same law as x^Ui... Ui^j, where (f/j) is a sequence 
of i.i.d. uniform random variables in [0,1]. As £j > m{w), we get X{w,i-) > x^ 
and hence, ^ Xy. Since both v and w' are of type L, it follows from (ii) that 
deg(n) :< deg(tc') ^ (note that w' G Bt{wo,i)). 

This concludes the proof of the lemma. □ 

Proof of the upper bound. We first recall a key estimate on the survial probability of the 
contact process on (T 2 , o) = GW(qo, qi). Note that qo(/c) x k~^ and qi(/c) x with 

u = 2 + l/ip. In |MVY] . more precisely in Sections 6.1, 6.2 and 6.3, the authors proved 
that when the exponent u is larger than 5/2, there exist positive constants C = C{i') and 
R = R{v), such that 

(21) P(£T,(o,i^))<CA2^-^|logA|2 -^ 

where 

£t2(0) R) = {(O) 0) GA i?T2(o, Py X M+}, 

is the event that the contact process starting from o infects vertices outside Bf^o^ P). 
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Importantly, all their proofs involve only what happens inside the ball (o, R) with 
R = R{i') fixed. On the other hand, it follows from Lemma [3.II that 

(22) cqi(fc) < qR-i(/c) < 0(log fc)^"^qi(fc), 

with c and C independent of A. In fact the logarithmic term on the right-hand side does 
not make any difference in all the proofs in |MVY| (we refer to their Sections 6.2 and 6.3 
for more details) Q . Hence one can prove exactly as (l2T]) that 

(23) P(£T«(o,i?)) < logAI^-", 

with the same constant R = i?(z^) and a possibly different C = C{i'), and where 

R) = {(o, 0) -H- Ry X M+}. 

Thanks to Lemma 13.31 Bt{wo,R) ^ Bjj^{o,R). Hence it follows from fl2^ and the 
monotonicity property of the contact process that 

(24) P(£t(ico,/ 2)) < P(£T«(o,i?)) < CA^+^/pj iogA|-i/b^ 
where 

Ct{wo,R) = {(rcojO) -fA BT{wo,Ry x M+}. 

On the other hand, the contact process starting from Wq survives forever only if the virus 
infects vertices outside the balls Bt{wq,R) for all R. Therefore 

^ 0 Vf > 0) < P(£r(wo,/2)) < OAi+2/P| 

which proves the desired upper bound. □ 

Remark 3.4. In the case > 3, in |MVY) . the authors improved fl2ip as follows 

P(£T.(o,i?)) <CA2""'|logA|2(2-0^ 

for some positive constants C = C{iy) and R = i?(i^. A). Although in the case of (T^j, o) the 
exponent i/ = 2 -|- 1/^ is larger than 3, we can not apply this result, since here R = R{X) 
depends on A. Therefore the distribution q/j-i also depends on A and (|2^ does not hold 
anymore, so the proof in |MVY| can not be used here. 

3.2. Proof of the lower bound. In this part, we first estimate the probability that 
there is an infinite sequence of vertices, including wi, with larger and larger degree and 
a small enough distance between any two consecutive elements of the sequence. We then 
repeatedly apply Lemma 12.81 and 12.91 to bound from below the probability that the virus 
propagates along these vertices, and like this survives forever. To this end, we denote by 

(25) = 

with c* * as in Lemma 12.81 and 12.91 

1 

• In their proof, we need to replace the event {^° ^ 0 Vt > 0} by {(o, 0) O B{o, Ry x K+}. 

• In Section 6.3 of their proof, in order to estimate the probability of and Sf, take e[ = 
min(2i2 — 5,2)/2 instead of e'l = {2v — 5)/2. 
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Lemma 3.5. There is a positive constant c, such that for A small enough, 

P(AA) > 

where 


= {3(j£)£>i : ji = l,deg(M;jJ > 2^+V(-^)/^ > W > 1}. 


Proof. It follows from Markov’s inequality that for any k > 1, 

(26) ¥{Ui ...Uk> 


where {Ui) is a sequence of i.i.d. uniform random variables in [0,1]. 

Now recall that for any i, 

P(deg(M;i) = m + l + k\ xi) = ^ 

with a = m + 2mr + 1. Let C be some constant, such that for all > 1, 


T{k + a) 
T{a)k\ 


< Ck‘^-^ 


The existence of C is guaranteed by the fact that r{k + b)/r{k) x k^ when b is fixed. 
Then 


clxf 

(27) P(deg(t(;j) < m + 1 + (c/xf) \ xf) < ^ Ck°‘~^xf^ < Cc“, 

/c =0 

for any c > 0. Set ji = 1 and = M/f) for £ >2. Then dehne 

^fe = {xp < {A^p{X)/{cfj))-^/'^,deg{wj,) > 2^+V(A)/V’} 
for all £ > 1, where c is a positive constant to be chosen later. 

As 2^~^^cp{X)/tjj > A(p{X)/fj = ip{X)d{wj^,Wj^^f), we have 

CX) 

(28) A7Dp|A/;. 

i=i 


Since Xj^ is distributed as XiUi... applying fl26]) gives that 

F{xj^ < XiA-^^^) > 1 - 2 ( 4 -^/^^). 

Therefore 


(29) P {x,, < (4V(A)/(cV^))-'/’^ I M) > 1 - 2(4-^/^^). 

By using flTTI) with (2^“^c) instead of c we obtain that 

(30) P (deg(M;jJ > 2^+V(A)/'0 I Xj, < (4V(A)/(c'0))"^/^) > 1 - C(2^"^c)“. 
Then it follows from fl2^ and flHUj) that 

( OO \ CO CO 

Pi A/; I A/) > 1 - 2 ^ 4-^/^ - C ^(2i-^c)“ > 1/4, 

1=2 J 1=2 i =2 

provided c is small enough. We now estimate P(A/i). Let 

7 (A) = (4,,(A)/(c^))-‘''* 
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Recall that xi is uniformly distributed on [0,xo], with xq ~ and Uq «(| 0 ,l|)^ 
Therefore 


lP(aii < 7 (A)) = E 


min{ 7 (A),a:o} 


Xo 


> 7(A)P(xo > 7 (A)) 
= 7(A)(1-7(A)'/’‘) 


(32) > 7(A)/2, 

for A small enough. On the other hand, fl27|) gives that for c small enough 

(33) P(7Vi I xi < 7 (A)) > 1 - Oc“ > 1 / 2 . 

We thus can choose c such that the two inequalities in flTID and fl55]) are satished. Now it 
follows from (l28|) . (13T]) . and (l33|) that 

IP(V) > 7(A), 


which implies the result. 


□ 


Proof of the lower bound. By repeatedly applying Lemma 12.91 to the couple of vertices 
(tCj^, we obtain that 

00 

P(^i 7 ^ 0 Vt > T I is lit at some time T) > 1 — 2 y~~^exp(—c*A^ 2 ^'*~^y 9 (A)/' 0 ) 

£=1 

00 

>l-25^exp(-7(2'+i)|logA|/i/;) 

£=1 

(34) > 1/2. 

On the other hand, by using Lemma [2.81 (i), we have 

¥{wi is lit at some time T \ J\f, o is infected at time 0) 

(35) > cAE(l — exp(—c*A deg(tci)) | J\f) > cA/2, 

for some c > 0 (note that on J\f, we have c*Adeg(tci) > 7). Now the result follows from 
(IMj) . fi35|) and Lemma l375l □ 


4. Proof of Theorem O 


By using the self-duality of the contact process ([3]), we see that to prove (j2]), it is 
sufficient to show that 


(36) 

and 

(37) 


Pri 





i(fe”./a}) <««''*! logAI 

l({7i^<2})>cA‘+^'*|logAr 



1 - 0 ( 1 ), 


1 - 0 ( 1 ), 


for some positive constants c and C. We will prove these two statements in the next two 
subsections. 
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4.1. Proof of flMj) . For any n G Pn and i?, we define 

Cn{v,R) = {(n,0) GA BgS'^.RY X M+} 

and 

= l{Cri{v,R)). 

Theorem 12.21 yields that for any R>1 

(38) lim R)) = P(£r(wo, R)), 

n—^oo 

where n is a uniformly chosen vertex from Vn- By combing this with fIMl) we obtain that 

lim = 1 ) < logA|-i/’^, 

n—)-oo 

or equivalently 


(39) 


lim - W ¥JX„ = 1) < CA‘+=''*| logA|-‘«’. 

n^oo Tl * ^ 


V^Vn 


Now, let us consider the set 

Wn = {(n, v') e 14 X 14 : d{v, 4) >2R + 3}, 

with R = R{i') as in fl23|) . Since i? + 1 < 62 logn/(loglogn) for n large enough, Lemma 
12.61 implies that 

P4(n,4)^lFn) = o(n2). 

V,v'eVn 

On the other hand, if {v,v') G 114 then and X^i are independent. Therefore 
(40) ^ Cov(X^,X„/) = o(n^). 

V,v'eVn 

Thanks to fl5^ and fl4P]) by using Chebyshev’s inequality we get that 


(41) 


P„ <24A^+2/Vi|iog^|-i/h j 

\ V&Vr, J 


= 1 - 0 ( 1 ). 


Since the contact process on a hnite ball in the Polya-point graph a.s. dies out, 

lim P(£t(wo, Ry n 4 0}) = 0. 

t —^00 

Hence for any e > 0, there exists t^, such that 

(42) nCT{wo,Rrn{^rfy0})<e. 

For any n G 14, define 

x,^, = i{C^{v,Rrn{^iy0}). 

Then for n large enough such that tn > we have 

(43) 1({C. # 0}) < V, + X.,,. 

It follows from Theorem 12.21 and that 

lim E4X„,4 = P(£t(ito, RT n {47 y 0}) < e. 

n—)-oo 

By using this and Markov’s inequality we get that for n large enough, and for any r; > 0, 


(44) 


\ n ^ > h I < 

\ vev„ / 




nr] 


V 


V 
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By combining fl4T]) . (I4^ and (jH]), then letting e tend to 0, we obtain that 

^ ®}) S 3CA‘+W| log A|-‘« ) = 1 - 0(1), 

which proves fl36|) . □ 

4.2. Proof of fl37|) . This subsection is divided into three parts. In the hrst one, we will 
show that w.h.p. there are many vertices with large degree (larger than x*logn). By 
using on the other hand that the diameter of the graph is smaller than bi log n, we can 
deduce that if one of these large degree vertices is infected, then the virus survives w.h.p. 
for a time exp(c?T./(logn)^/'^), see Proposition 14.21 In the second part, we measure the 
density of potential vertices which are promising for spreading the virus to some of these 
large degree vertices. In the last part, we estimate the proportion of potential vertices 
which really send the virus to large degree vertices, getting this way (l37|) . 


4.2.1. Lower bound on the extinction time. Our aim in this part is to hnd large degree 
vertices as mentioned above. We then prove that if one of them is infected, the virus is 
likely to survive a long time. 

Lemma 4.1. Let x > 0 6 e given. Then there exists a positive constant c = c(x), such 
that An holds w.h.p. with 

An = {Gn contains cn/(logn)^'^*-^~^^ disjoint star graphs of size larger than xlogn}. 


Proof. Let £ G (0,1/3) be given, and let K = K[e) and be as in Lemma 12.41 Set 
On = (Mlogn)^/^^“^\ with M to be chosen later. Denote by 

A = {vi'.ie [n/an,2n/an] and xji G {O/i^p/i)} 


and 

8 = EeA{\A\> 9n/an], 

with 9, p as in Lemma [2.41 Recall that the events {fji G {9/i,p/i)} are independent and 
have probability larger than 29. Therefore (jl]) implies that w.h.p. 1^41 > 9n/an. Hence 
for n large enough 

¥n{8) >l-2e. 

We now suppose that 8 happens. In particular, — {j/k)A < ^{j/k)^ for all K{e) < 
j < k. Denote the elements of A as {uji, ...,Vj^} with i G [0n/a„, n/a„]. Then dehne 

Ai = {vj : u/2 < j < n}. 

We will show that all vertices in A have a large number of neighbors in Ai. First, it 
follows from dH]) that if j < k, then Vj and Vk are neighbors with probability of order 
Therefore there are positive constants c and C depending on 9, p such that for 
all Vj G A and G Hi, 

cn^~^ Cl 

(45) < ¥n{vj ^Vk\8)< 

n n 

Conditionally on {fjj), the events {{vj^ ~ Vk\}k&Ai are independent. Hence thanks to (j4]) 
we get that there are positive constants 9i, ci, Ci (depending on c and C), such that 


P 


n 



< 

Vk&Ai 


~ Vk) < Cia\ ^ 



> 1 


exp{-9ial ^), 
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or equivalently 

(46) Pn(£:i I > 1 - exp(-6'ia),"^), 

where 

= {c,al-^ < \B,\ < 

and 

Bi = {vk e : Vj^ ~ Vk}. 

Note that in this proof, the values of the constants 6i, ci and Ci may change from line to 
line. Now let us consider 


A 2 = Ai\ Bi and B 2 = {vk G A 2 : ~ Vk}. 

We notice that on Si fl S, the cardinality of A 2 is larger than nj2 — Cia]s^ > n/'i. Thus, 
similarly to flT6)) we can show that 

¥n{S2 I n £:) > 1 - exp(- 6 'ia),"^), 

where 

82 = {cials^ < \B2\ < Cia}~^}. 

Likewise for all 2 < 5 < ^, define recursively 


^s — 1 \ — Bg G Ag . Vj^ ~ 

T, = {ciay^ < \B4 < Cials^}. 

S — 1 

On T n n Si, we have > n/2 — sOia„“^ > n/4, and 

i=l 


Pn T, 


s—1 

sn nSi 

i=l 


> 1 - exp -(6*ia)j 


Hence 

Si\S^>l - nexp(-6'iay^)/a„. 

Taking M large enough such that Cia]s^ > xlogn and nexp{—6ials^) < 1 yields that 
(47) P„(|i?s| > xlogn 1 < s < I \ S) >1 — a~^. 



Moreover, by dehnition Bg A Bt = 0 for all s ^ t. Hence, all vertices in A have more 
than xlogn distinct neighbors. Finally, take c such that cn/(logn)^/*^^“^) < On/an, for 
instance c < In conclusion, we have shown that for any given e G (0,1/3), 

Pn(v4,„) > 1 - 25 - > 1 - 36, 


for n large enough. Since this holds for any e > 0, the result follows. 


□ 


To determine the constant x in the dehnition of An, we hrst recall that 

P„(H„) = l-o(l), 

where 

Bn = {d{Gn) < foilogn}. 

Hence to apply Lemma [2.91 to the large degree vertices exhibited in the previous lemma, 
we need 
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Moreover, at some point later it will be convenient to have also x > 3/(c*A^). So we let 



Then we let c* = c*(x*) and An be defined accordingly as in Lemma [4. II 

A set of vertices V = {tci,..., Wk} C Vn is called good if IS*^. \ \ > x* logn for 

all 1 < i < A;, where denotes the star graph formed by v and its neighbors. 

Let 1^ be a maximal good set i.e. |14^| = max{|l/| : 1/ C 14 is good}. 

Proposition 4.2. There exists a positive constant c, such that 

Pn 7^ 0 I ^0 n 14 * 4 0) = 1 - 0 ( 1 ), 

where Tn = exp(cA^n/(logn)^/’A). 

Proof. Thanks to Lemma [2.51 and l4.ll we can assume that d{Gn) < foilogn and llA'I > 
c*n/(logn)^/^^“^fi Assume also that at time 0 a vertex in Vf, say v, is infected. 

Due to the definiton of Vf, for any w ^ Vf, we can select from the set of tc’s neighbors 
a subset D{w) of size x* logn, such that D{w) fl D{w') = 0 for all w 4 w'. 

We say that a vertex w in I 4 ' is infested at some time t if the proportion of infected sites 
in D{w) at time t is larger than A/(16e) (the term is taken from |MMVY] L 

It follows from Lemma [2.81 (ii) that v becomes infested with probability tending to 1, 
as n —)■ cxo. Using Lemma 12.81 liiil and 12.91 Inote that \D{w)\ > {7/{c*X‘^))\\ogX\d{w,w')), 
we deduce that for any f > 0 and w E Vf, 

P„(n makes w infested at t + 2exp(c*A^x* logn) | v is infested at t) 

> 1 — 4exp(—c*A^x* logn). 

Therefore 

Pn(n makes all vertices in Vf infested at f + 2exp(c*A^x* logn) | v is infested at t) 

> 1 — 4nexp(—c*A^x* logn) 

> 1 — n“^, 

where for the last inequality we have used that c*A^x* > 3. Now if all vertices in Vf are 
infested at the same time, then the proof of Proposition 1 in cm shows that the virus 
survives a time exponential in deg(n). More precisely, let In,t be the number of 

infested vertices in Vf at time t. Then there is a positive constant rj, such that for all 

k<\v:\, 

Pn iln,sk > A;/2 I ln,0 > k) >1 - S^^, 

where Sk = exp(riX^kx* logn). The result follows by taking k = c*n/(logn)^/*'^“^^ □ 

4.2.2. Density of potential vertices. In this part we will estimate the proprotion of poten¬ 
tial sites from where the virus can be sent with positive probability to a vertex at distance 
quite small (of order (loglogn)^) and with large degree (larger than x* logn). 

This proportion approximates the probability that there is an infection path from the 
uniformly chosen vertex n to a vertex with degree larger than x* logn. To bound from 
below this probability, based on the idea of Section 3.2, in particular Lemma [3.51 we find 
a sequence of vertices starting from a neighbor of u and finishing at a large degree vertex, 
satisfying the hypothesis of Lemmafor spreading the virus from u to the ending vertex, 
see Lemma [4.41 
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Here are just some comments on the order of magnitude above. First, if a vertex with 
degree larger than k* logn is infected, then w.h.p. it will infect a site in V* , and then we 
can conclude with Proposition 14.21 Secondly, (log log is the distance from a potential 
vertex to a large degree vertex and is much smaller than the typical distance between 
two different potential vertices. Hence the propagation of the virus from these potential 
vertices to their closest large degree vertex are approximately independent events. 

Set 

Rn = (log log n)^. 

For w G 14 , define koiw) hy w = Vko{w), and for i > 1 define ki{w) by the conditions 
ki{w) < ki-iiw) and Vki{w) ~ Vki.i{w) (note that the choice of ki{w) is not necessarily 
unique). We define also 

R-niw) = {ko{w) > n/ logn} n {4+1 (m) > ki{w)/ log ki{w) > VO < z < Rn}- 

Lemma 4.3. There is a positive eonstant Oq, such that for all 9 < 9 q, for all e G (0, 1/2), 
and for any vertex w, we have 

[ max deg(n) > 9e^^{n/ko{w)Y~^ \ n'H„(tc) ] > 1 — e~^\ 

\veBcr,iw,i) J 

for all i < Rn, and 

Pn (Rniw) I ko{w) > n/\ogn,SY = 1 - o(l/log logn). 


Proof. Let us begin with the second assertion. Due to the construction of Gn, if ki{w )/ log ki{w) > 
K{e), then 


¥niki+i{w) < ki{w)/log ki{w) I Se,ki{w),{(pt)) = 


Ski{w)-1 


< 


1 + 


1 - c \logki{w)_ 

Hence for all i < Rn = (log log n)^, 

Pn {ki+i{w) > n/(logn)*+^ | Se,ki{w) > n/(logn)*+^) = 1 - o((logn)"^/^). 


which proves the result by using a union bound. 

For the first inequality, we claim that there is a positive constant cq, such that for any 
c < Co, there exists d = d (c) > 0, such that for all i < Rn 
(i) Pn {k[i/2]{w) < e"“/co(u;) | SenRniw)) > 1 - 


(ii) Pn (3j G ii/2,i) : Ykj{w) > c/kj{w) \ SeORniw)) > 1 - e 


(hi) P„(deg(nfc) > d{n/ky ^ \ Yk > c/k,Se) > 1 - exp(-c'(n//c)^ ^), for any Vk G K- 


From these claims we can deduce the result. Indeed, (i) and (ii) imply that with probabil¬ 
ity larger than 1 — 2exp(—cV) there is an integer j G {i/2,i) such that kj{w) < e~'^^ko{w) 
and ifkjiw) 4 c/kj{w). Then (hi) gives that deg(nfc^.(^)) > d{n/ kQ{w)Y~^ with 
probability larger than 1 — exp(—Hence the result follows by taking 6 small 
enough. 

To prove (i), similarly to Lemma [2.61 we consider 

Xj{w) = l{{kj{w) < kj-i{w)/2}) and Rj{w) = a{kt{w) ■ t < j) V a{{^pt)). 
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On T-Lniw), we have K{e) < y/n < kj{w) for all j < Rn- Then by using the same argument 
as in Lemma [2.61 we obtain that on Rniw) O Ss, 


and 

(49) 




S[kj{w)/ 2 ] S[kj(w)/log kj{w)] 

^kj (ui) —1 


> p 


[i/2\ 


P„ j '^^Xj{w) > ip/A j > 1 — 2 exp(— 2 p^/ 16 ), 
V=i 


for some constant p > 0. Since k\i/ 2 ]{w) < 2~'‘^/‘^ko{w) as soon as Y^j=i 
the result follows from fl49|) . 

We now prove (ii). Let 9 be the constant as in Lemma [2.41 (v). Fix some j G (z/2,i) 
and set k = kj{w) — 1 and (. = [kj{w)/\ogkj{w)]. On Rniw) OTg, we have k > kj^i{w) > 
i > y/n > K{e), and 


Pn i'i/kj+pw) > 0/kj+i{w) \k,i) S - S ^ > 9/t) I k,t 

, ^ ^ t=t+\ J 

(TT^ ^ I 


> 


> 


n-' 


(50) 


{lRe)k^ 

6n^ 

(1 + e)k^ 

6n^ 

{l + e)k^ 

>e/A, 


> 


> 


t=£+i 

( k 

6 ^Pt\ky 

t=£+l 

E„ {Sk-S, I fc,£) 

[(1 - e) (k/n)^ - (1 + e) {(./nY] 


for £ G (0,1/2), where for the second inequality we have used Lemma [2.41 (v). Now (ii) 
follows from the same argument as (i). Finally, (hi) can be proved as ([46]) . □ 


Lemma 4.4. Let u be a uniformly chosen vertex from W- Then there exists a positive 
constant c, such that 

Pn(Af) > cA99(A)-1/^, 

where 


M. = {3ta G Bg„{u, Rn) : deg(tc) > x* logn} O {(^“) makes w lit inside Bcniu, Rn)}- 

Proof. Define fco by for i > 1 define ki by the conditions R < R-i and 

Vki ~ 'Vki^i- Let us denote Ui = Ufci and define also 

Rn ■= Rniui) = [ki > n/logn} O {fcj+i > ki/logh > VI < i < + 1}. 

In this proof, we assume that £ = o(A(p(A)“^/’^). Similarly to Lemma [T^ bv using that ko 
is chosen uniformly from {1,... ,n}, we have P„('H„ | = 1 — o(l/loglogn) and hence 

T’niTe n Rn) = 1 — o(Ay9(A). We assume now that these two events happen. 
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We recall the claim (iii) in the proof of Lemma l473l there is a positive constant cq, such 
that for any c < cq, there exists d = c'(c) > 0, such that 

(51) P„(deg(Tfc) > d{n/kf~^ \'dk>c/k)>l- exp(-c'(n/fc)^"^), 

for any G W- Let us consider 

Ml = {ki < n/d{\)], 

where 7 (A) = (4(p(A)/c'6*^)^/^“^, with 9 a small enough constant (smaller than 9q as in 
Lemma 12.41 and 141^ and than cq), and d = d{6). Define 

M 2 = Ml n {dl < i < R'^ 3we : d{ui,Wi) < re,deg{wi) > ip{X) exp{6re)}, 

where ri = Ai/6‘^ioTl<i<R'^:= 9‘^Rn/8. 

By using Lemma 14.31 for ui we get that for any i < R'^ 


P. 


max deg(T) > 9e^^^{n/kiY ^ ] > 1 — 


,-9re 


If ki < n/ 7 (A), then 6 *exp( 6 'r£)(? 7 ,/fci)^“^ > 6 'exp( 6 'r£) 7 (A)^“^ > <p(A) exp( 6 *r£). Thus 
P„ (3 t : d{v, Ml) < r^, deg(M) > <p(A) exp( 6 *r£) | A4i) > 1 — . 

Hence 

K 

(52) Pn(A ^2 I -Ml) > 1 - ^exp(-4£/0) > 1 - 2exp(-4/0). 

i=i 

Dehne 

M'i = -Ml n {deg(Mi) > 4(p(A)/6*^}. 

Similarly to (1501) . we can show that 

Pn (V'fci > 0/ki I ki < n/ 7 (A)) > 6>/4. 

It follows from fl5T]) and the fact that c' 7 (A)^“^ = 4(p(A)/6*^, that 

P„ (deg(Mi) > d{n/kif~^ \ ki < n/d{X),'dki > d/ki) > 1 - exp(-4(p(A)/6'^) > 1 / 2 . 
From the last two inequalities we deduce that 

Pn(-M3 I -Ml) > 9/8. 

Combining this with fl5^ we obatin that 

(53) Pn(-M 2 n Mu I Ml) >9/8-2 exp(-4/9) > 9/16. 

We now bound from below P„(A4i). Observe that 

^[n/ 7 (A)]l({fco > n/d(X)}) 


Pn (ki < n/-/(A) ko,((fj)]> 


> j(A)-’^ 


dko-l 

k(^^ 


n 


l({fco > m/7(A)}). 


Since k^ is distributed uniformly on {1,..., n}, we get 


En {{kii/n)'’^l{{kii > n/ 7 (A)})) = - V (fc/n)^ x 1. 

n 


k=nl^(\) 
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Therefore 

P„ (Ml) > ^(A)-* > 4=(A)-W 

This and flS5]) give that 

(54) Pn(-M2nA43) 

Observe that on A42 O A^s, we have deg(ni) > Lp{\)ri > ip{\)d{ui,wi) and 

deg(w;£) > (y9(A) exp(6'r£) > 2(p(A)r£+i > ^p{X)d{w(,,we,+i) 

for all 1 < -^ < R'^. In other words, Ui and the vertices {w^) satisfy the condition in 
Lemma 12.91 and thus applying this lemma inductively yields that 

is lit inside Bg„{ui, Rn) \ M 2 O A43,Mi is lit) 

R'n 

> l-^exp(-c*AV(A)e®"0 

e=i 

(55) > 1. 

Similarly to fl55D . the probability that (,^“) makes Ui lit is of order A. It follows from this 
and fl55]) that 

(56) Pn((C) makes wr>^ lit inside Bg„{u, Rn) \ M 2 O A 43 ) > A. 

In addition, deg(t(;/j^) > x*logn. Therefore 

M D A42 n A43 n {(^“) makes wr/^ lit inside Bg^{u, Rn)}- 
Combining this with fl541) and fl56|) gives the result. □ 


4.2.3. Proof of fl37)) . For any v G 14, we dehne 
Yn = 1({3m; G Bg„{v, Rn) ■ deg(M;) > x* logn} O {(^(') makes w lit inside Bg„{v, Rn)}) 
and 

Z„ = Y„U.{ik^‘^}). 

where is as in Proposition 14.21 Then 

5^ z„ < 5^ 1(U?.„ # 0}), 

veVn V&Vn 

and thus fl37|) follows from Lemma [4.51 below and an application of Markov’s inequality. 
Lemma 4.5. The following assertions hold: 

(i) Pn ( ^ > cA 9 ?(A)“^/'^ I = 1 — 0 ( 1 ), for some c> 0, independent of X. 

\ vev-n J 

(ii) P„ [Z-u = 1 I 14 = 1 ) ^ 1 , as n —)■ 00 uniformly in v ^ I 4 . 

Proof. For (i), let e G (0,1/2) be given. We have to show that the probability in the 
left-hand side is larger than 1 — 2^ for n large enough. First, Lemma [4.41 implies that 

p„(n = 1) > a,,(a)-w. 


or equivalently 


1 


E = 1) ^ V(A) 


1/p 


n 
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Using Chebyshev’s inequality, the result follows from this and the following claim: on 8^ 

(57) ^ Cov(U;, F„/) = o(n^). 

V,v'&Vn 

To prove it, we consider 

Vn = {{vi,Vj) : i,j > n/logn,d{vi,Vj) > + 3}. 

Since + 1 < ^2 logn/(log logn) for n large enough, it follows from Lemma [2.61 that 

(58) P„((n,n') ^V„) = o(n2). 

V,v'GVn 

On the other hand. Lemma [4.31 gives that ii i > n/ logn, then on 8^ 

P„(3u; e Bc^iyuRn) ■ deg(M;) > K*\ogn) = 1 - o(l/loglogn). 

Moreover, given the graph G„, 17 and 1^/ only depend on the Poisson processes defined 
on the vertices and edges on the balls Bg„{v, Rn) and Bg„{v', Rn) respectively. Hence on 
8e for all (n,n') G Vn, 

(59) Cov(U;,17/) = o(l/loglogn). 

Now (157|) follows from (155]) and (155]) . 

We now prove (ii). If 17 = 1, then there exists a vertex w such that deg(tc) > x* logn 
and w is lit at some time. Besides, on Bn the diameter of the graph is bounded by bi logn 
w.h.p. Hence similarly to Lemma 12.91 we can show that on Bn 

Fn{w infects a vertex in IG ) > 1 — exp(—logn). 

If one of the vertices in V* is infected, it follows from Proposition 14.21 that w.h.p. the 
virus survives up to time T„. Hence we obtain (ii) by using that Bn holds w.h.p. □ 


Remark 4.6. Using Proposition 6.2 in |CS| and the facts that is connected and 

d{Gn) = G(logn), we can obtain another metastability result. Let be the extinction 
time of the contact process with infection rate A > 0 starting from full occupancy. Then 
the following convergence in law holds 

Tn (C) 




[Tn 


^( 1 ), 


with 8{1) an exponential random variable with mean one. 

As mentioned in the introduction, the bound on tn in Theorem 1 1.1 1 is nearly optimal, but 
by Remark 6.4 in |CS) one could improve it if one could prove that w.h.p. > exp(cn), 
for some c > 0. We expect the last assertion to hold like in the case of the configuration 
model |MMVY1 ICS] for instance. 
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